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Abstract 

We show the diagonal problem for higher-order pushdown automata (HOPDA), and 
hence the simultaneous unboundedness problem, is decidable. From recent work by Zet- 
zsche this means that we can construct the downward closure of the set of words accepted 
by a given HOPDA. This also means we can construct the downward closure of the Parikh 
image of a HOPDA. Both of these consequences play an important role in verifying concur¬ 
rent higher-order programs expressed as HOPDA or safe higher-order recursion schemes. 


1 Introduction 

Recent work by Zetzsche |40j has given a new technique for computing the downward closure 
of classes of languages. The downward closure J(£) of a language C is the set of all subwords 
of words in C (e.g. aa is a subword of babab). It is well known that the downward closure is 
regular for any language m- However, there are only a few classes of languages for which it 
is known how to compute this closure. In general it is not possible to compute the downward 
closure since it would easily lead to a solution to the halting problem for Turing machines. 

However, once a regular representation of the downward closure has been obtained, it can 
be used in all kinds of analysis, since regular languages are well behaved under all kinds of 
transformations. For example, consider a system that waits for messages from a complex 
environment. This complex environment can be abstracted by the downward closure of the 
messages it sends or processes it spawns. This corresponds to a lossy system where some 
messages may be ignored (or go missing), or some processes may simply not contribute to the 
remainder of the execution. In many settings - e.g. the analysis of safety properties of certain 
kinds of systems - unread messages or unscheduled processes do not effect the precision of 
the analysis. Since many types of system permit synchronisation with a regular language, this 
environment abstraction can often be built into the system being analysed. 

Many popular languages such as JavaScript, Python, Ruby, and even C-I--I-, include higher- 
order features - which are increasingly important given the popularity of event-based programs 
and asynchronous programs based on a continuation or callback style of programming. Hence, 
the modelling of higher-order function calls is becoming key to analysing modern day programs. 

A popular approach to verifying higher-order programs is that of recursion schemes and sev¬ 
eral tools and practical techniques have been developed [531 IMl HSl HH IMl ISl El El] • Recursion 
schemes have an automaton model in the form of collapsible pushdown automata (CPDA) [IB] 
which generalises an order-2 model called 2-PDA with links [1] or, equivalently, panic au¬ 
tomata [22]. When these recursion schemes satisfy a syntactical condition called safety, a 
restriction of CPDA called higher-order pushdown automata (HOPDA or n-PDA for order-n 
HOPDA) is sufficient [531 HI]. HOPDA can be considered an extension of pushdown automata 
to a “stack of stacks” structure. It remains open as to whether CPDA are strictly more expres¬ 
sive than nondeterministic HOPDA when generating languages of words. It is known that, at 
order 2, nondeterministic HOPDA and CPDA generate the same word languages [1]. However, 
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there exists a language generated by a deterministic order-2 CPDA that cannot be generated 
by a deterministic HOPDA of any order m- 

It is well known that concurrency and first-order recursion very quickly leads to undecid¬ 
ability (e.g. [33]). Hence, much recent research has focussed on decidable abstractions and 
restrictions (e.g. [TU S] [301 (37] [T311371133| [TUI (HI)- Recently, these results have been extended 
to concurrent versions of CPDA and recursion schemes (e.g. [33] [331 [13133]). Many approaches 
rely on combining representations of the Parikh image of individual automata (e.g. [njdzKii]). 
However, combining Parikh images of HOPDA quickly leads to undecidability (e.g. [U])- In 
many cases, the downward closure of the Parikh image is an adequate abstraction. 

Computing downward closures appears to be a hard problem. Recently Zetzsche introduced 
a new general technique for classes of automata effectively closed under rational transductions 
- also referred to as a full trio. For these automata the downward closure is computable iff the 
simultaneous unboundedness problem (SUP) is decidable. 

Definition 1.1 (SUP [IS])- Given a language £ C a);... a* does ),(£) = a);... a* ? 

Theorem 1.1. 14 0[ Theorem 1] Let C be class of languages that is a full trio. Then downward 
closures are computable for C if and only if the SUP is decidable for C. 

Zetzsche used this result to obtain the downward closure of languages definable by 2-PDA, 
or equivalently, languages definable by indexed grammars [2]. Moreover, for classes of languages 
closed under rational transductions, Zetzsche shows that the simultaneous unboundedness prob¬ 
lem is decidable iff the diagonal problem is decidable. The diagonal problem was introduced by 
Czerwihski and Martens m- Intuitively, it is a relaxation of the SUP that is insensitive to the 
order the characters are output. For a word ic, let |rc|a be the number of occurrences of a in w. 

Definition 1.2 (Diagonal Problem [H]). Given language L we define 

Diagonal^^ = \fm.3w € £.V1 <i< a.\w\ai > m . 

The diagonal problem asks if Diagonalg^^ oc(^) holds of C. 

Corollary 1.1 (Diagonal Problem and Downward Closures). Let C be class of languages that 
is a full trio. Then downward closures are computable for C if and only if the diagonal problem 
is decidable for C. 

Proof. The only-if direction follows from Theorem 11.11 since given a language £ C a]'... a* the 
diagonal problem is immediately equivalent to the SUP. In the if direction, the result follows 
since C satisfies the diagonal problem iff ),(£) also satisfies the diagonal problem. Since the 
diagonal problem is decidable for regular languages and ),(£) is regular, we have the result. □ 

In this work, we generalise Zetzsche’s result for 2-PDA to the general case of n-PDA. We 
show that the diagonal problem is decidable. Since HOPDA are closed under rational transduc¬ 
tions, we obtain decidability of the simultaneous unboundedness problem, and hence a method 
for constructing the downward closure of a language defined by a HOPDA. 

Corollary 1.2 (Downward Closures). Let P be an n-PDA. The downward closure ).(£(£)) is 
computable. 

Proof. From Theorem 16.31 (proved in the sequel), we know that the diagonal problem for 
HOPDA is decidable. Thus, using Corollary 11.11 we can construct the downward closure of 
P. □ 
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This result provides an abstraction upon which new results may be based. It also has several 
immediate consequences: 

1. decidability of separability by piecewise testable languages, which follows from from Cz- 
erwihski and Martens m, 

2. decidability of reachability for parameterised concurrent systems of HOPDA communi¬ 
cating asynchronously via a shared global register, from La Torre et al. [55] . 

3. decidability of finiteness of a language defined by a HOPDA, and 

4. computability of the downward closure of the Parikh image of a HOPDA. 

We present our decidability proof in two stages. First we show how to decide Diagonal(j(P) 
for a single character and HOPDA P in Sections [3] and E) In Sections [5j [6j and [7] we generalise 
our techniques to the full diagonal problem. 

In Section O we give an outline of the proof techniques for deciding Diagonal£,(P). In 
short, the outermost stacks of an n-PDA are created and destroyed using push„ and pop„ 
operations. These push„ and pop„ operations along a run of an n-PDA are “well-bracketed” 
(each push„ has a matching pop„ and these matchings don’t overlap). The essence of the idea 
is to take a standard tree decomposition of these well-bracketed runs and observe that each 
branch of such a tree can be executed by an (n — 1)-PDA. We augment this (n — 1)-PDA with 
“regular tests” that allow it to know if, each time a branch is chosen, the alternative branch 
could have output some a characters. If this is true, then the (n — 1)-PDA outputs a single 
a to account for these missed characters. We prove that, although the (n — 1)-PDA outputs 
far fewer characters, it can still output an unbounded number iff the n-PDA could. Hence, by 
repeating this reduction, we obtain a 1-PDA, for which the diagonal problem is decidable since 
it is known how to compute their downward closures [SilS]. 

In Section lSTTl we outline the generalisation of the proof to the full problem Diagonal^j^ oc (-P) ■ 
The key difficulty is that it is no longer enough for the (n — 1)-PDA to follow only a single branch 
of the tree decomposition: it may need up to one branch for each of the oi,..., Oq,. Hence, we 
define HOPDA that can output trees with a bounded number (a) of branches. We then show 
that our reduction can generalise to HOPDA outputting trees (relying essentially on the fact 
that the number of branches is bounded). 

2 Preliminaries 

2.1 Downward Closures 

Given two words w = € S* and w' = ai.. .ui € E* for some alphabet E, we write 

w < w' iS there exist H < ... < im such that for all 1 < j < m we have = Oi-. Given a set 
of words £ C E*, we denote its downward closure 4-(£) = {w | w < re' € £}. 

2.2 Trees 

A E-labelled finite tree is a tuple T = (D, A) where E is a set of node labels, and D C N* is 
a finite set of nodes that is prefix-closed, that is, r]6 € D implies rj € D, and A : £> —> E is a 
function labelling the nodes of the tree. 

We write £ to denote the root of a tree (the empty sequence). We also write 

a[Ti,.. .,Tm] 
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to denote the tree whose root node is labelled a and has children Ti,..., T^. That is, we define 
a[Ti,..., Tm] = {D', A') when for each 6 we have Ts = {Ds, Xs) and D' = {Srj \ rj € Ds}D {e} 
and 

AU) = I“ " . 

\Xsiv') V = Sri' 

Also, let T[a] denote the tree ({e}, A) where A(£) = a. A branch in T = {D, A) is a sequence 
of nodes oi T, iji ■ ■ ■ rjn, such that rji = e, ijn = di 62 • ■ • Sn-i is maximal in D, and = rjj Sj 
for each 1 < j < n — 1 . 

2.3 HOPDA 

HOPDA are a generalisation of pushdown systems to a stack-of-stacks structure. An order-n 
stack is a stack of order-(n — 1) stacks. An order-n push operation pushes a new order-(n — 1) 
stack onto the stack that is a copy of the existing topmost order-(n—1) stack. Rewrite operations 
update the character that is at the top of the topmost stacks. 

Definition 2.1 (Order-n Stacks). The set of order-n stacks over a given stack alphabet P 
is defined inductively as follows. 

= T 

S^+l = {[si.-.Sm]fc+i |Vt.S*e5[} . 

Stacks are written with the top part of the stack to the left. We define several operations. 


tOPfc([si . . 
tOPfc([si . . 

Sm\k) 

■^rn\n) 

= Si 

= tOPfc(si) 

n > k 

rew.y([7i.. 

rew.y ([si.. 

7m] 1 ) 
Sm]„) 

= [7 72---7m]i 

= [rew^(si) S 2 .. .Sm]„ 

n > 1 

pushfe([si.. 

pushj,([si.. 

Sm]^) 

■^rn\n) 

= [si Si . . . Sm\k 
= [pushfc(si) S 2 , .. . ,Sm]„ 

n > k 

POPfc([si .. 
POPfc([si .. 

Sm]if) 

•^mjn) 

= [s2 . • • Smjfc 

= [pOPfc( Si) S2, . . . ,Sm]„ 

n > k 


and set 

Ops„ = {rew.y I 7 e P} U {push^,,pop^ | 1 < fc < n} 

to be the set of order-n stack operations. 

For example 

push2([[7cr]J2) = [[7ct]i [7ct]i]2 

rew,^([[ 7 Cr]^[ 7 Cr]J 2 ) = [[cr cr]i [7 O-JJa ■ 

Definition 2.2 (HOPDA or n-PDA). An order-n higher order pushdown automaton (HOPDA 
or n-PDA) is given by a tuple (7^, S, P, 7^, J^,Pin, 7 in) where V is a finite set of control states, 
Tj is a finite output alphabet (that contains the empty word character e), T is a finite stack 
alphabet, T^CT^xPxEx Ops.,^ xV is a set of transition rules, T is a set of accepting control 
states. Pin &V is the initial control state, and 7 in € P is the initial stack character. 
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We write (p, 7 ) A {p',o) for a rule {p,^,a,o,p') € TZ. 

A configuration of an n-PDA is a tuple (p, s) where p & V and s is an order-n stack over 
r. We have a transition (p, s) A {p',s') whenever we have (p, 7 ) A- {p',o), top 2 (s) = 7 , and 
s' = o{s). 

A run over a word w S S* is a sequence of configurations cq -A- • • • Cm such that 
the word ai... am is w. It is an accepting run if cq = (pin, hinL) — where we write | 7 |„ for 
[■ ■ ■ [tIi ■ ■ ■]„ — where Cm = (p, s) with p ^ T. Furthermore, for a set of configurations C, 
we define 

PreJ,(C) 

to be the set of configurations c such that there is a run over some word from c to c' G C. When 
C is defined as the language of some automaton A accepting configurations, we abuse notation 
and write Prep (A) instead of Prep(£(A)). 

For convenience, we sometimes allow a set of characters to be output instead of only one. 
This is to be interpreted as outputing each of the characters in the set once (in some arbitrary 
order). We also allow sequences of operations oi;...; Om in the rules instead of single operations. 
When using sequences we allow a test operation 7 ? that only allows the sequence to proceed 
if the top]^ character of the stack is 7 . All of these extensions can be encoded by introducing 
intermediate control states. 


2.3.1 Regular Sets of Stacks 

We will need to represent sets of stacks. To do this we will use automata to recognise stacks. 
We define the stack automaton model of Broadbent tt al. [ 8 ] restricted to HOPDA rather 
than CPDA. We will sometimes call these bottom-up stack automata or simply automata. The 
automata operate over stacks interpreted as words, hence the opening and closing braces of the 
stacks appear as part of the input. We annotate these braces with the order of the stack the 
braces belong to. Let F[] = {[n-i, • ■ • > [ 1 , ]i) • ■ • Jn-i} O T. Note, we don’t include since 
these appear exclusively at the start and end of the stack. 

Definition 2.3 (Bottom-up Stack Automata). A tuple A is a bottom-up stack automaton when 
A is (Q,F, gin, Qf,A) where Q is a finite set of states, T is a finite input alphabet, q-m € Q is 
the initial state and A : (Q x T) ^ Q is a deterministic transition function. 

Representing higher order stacks as a linear word graph, where the start of an order-fc stack 
is an edge labelled and the end of an order-/c stack is an edge labelled ]/c, a run of a bottom-up 
stack automaton is a labelling of the nodes of the graph with states in Q such that 

1 . the rightmost (final) node is labelled by gin, and 

2 . whenever we have for any 7 G F[], and pair of labelled nodes with an edge g A g' then 
g = A(g',7). 

The run is accepting if the leftmost (initial) node is labelled by g G Qf- An example run over 
the word graph representation of [[[7 [o']i ]2 [Mi] 2]3 given in Figure [TJ 

Let C{A) be the set of stacks with accepting runs of A. Sometimes, for convenience, if we 
have a configuration c = {p, s) of a HOPDA, we will write c G C{A) when s G C{A). 


5 


Unboundedness of HOPDA 


M. Hague J. Kochems C.-H. L. Ong 


[2 [l 7 cr ]i [1 cr ]i h [2 [1 a ]l h 

Qf — 913 — 912 — 9ll — 910 — 99 — 98 — 97 — 96 — 95 — 94 — 93 — 92 — 9l — 9in 

Figure 1: A run over [[[7 cr]^ [cr]J 2 [MJsla 

3 The Single Character Case 

We assume S = {a, e} and use b to range over E. This can be obtained by simply replacing 
all other characters with e. We also assume that all rules of the form (p, 7 ) A- {p',o) with 
o = push„ or o = pop„ have b = e. We can enforce this using intermediate control states to 
first apply o in one step, and then in another output b (the stack operation on the second step 
will be rew^ where 7 is the current top character). We start with an outline of the proof, and 
then explain each step in detail. 

For convenience, we assume acceptance is by reaching a unique control state in J- with an 
empty stack (i.e. the lowermost stack was removed with a pop„ and J- = {p/})- This can 
easily be obtained by adding a rule to a new accepting state whenever we have a rule leading 
to a control state in From this new state we can loop and perform pop„ operations until 
the stack is empty. 

3.1 Outline of Proof 

The approach is to take an n-PDA P and produce an (n — 1)-PDA P_i that satisfies the 
diagonal problem iff P does. The idea behind this reduction is that an (accepting) run of P can 
be decomposed into a tree with out-degree at most 2 : each push„ has a matching pop„ that 
brings the stack back to be the same as it was before the push„; we cut the run at the pop„ and 
hang the tail next to the push„ and repeat this to form a tree from a run. This is illustrated in 
Figure [5] where nodes are labelled by their configurations, and the push„ and pop„ points are 
marked. The dotted arcs connect nodes matched by their pushes and pops - these nodes have 
the same stacks. Notice that at each branching point, the left and right subtrees start with 
the same order-(n — 1) stacks on top. Notice also that for each branch, none of its transitions 
remove the topmost order-(n — 1) stack. Hence, we can produce an (n — 1)-PDA that picks 
a branch of this tree decomposition to execute and only needs to keep track of the topmost 
order-(n — 1) stack of the n-PDA. When picking a branch to execute, the (n — 1)-PDA outputs 
a single a if the branch not chosen could have output some a characters. We prove that this is 
enough to maintain unboundedness. 

In more detail, we perform the following steps. 

1. Instrument P to record whether an a character has been output. Then, using known 
reachability results, obtain regular sets of configurations from which the current top„ stack 
can be popped, and moreover, we can know whether an a is output on the way. These 
tests can be seen as a generalisation of pushdown systems with regular tests introduced 
by Esparza et al. m- 

2 . From an n-PDA P, we define an (n — I)-PDA with tests P_i and then an (n — 1)-PDA 
P' such that 

Diagonal(j(P) Diagonal(j(P') . 

The tests will be used to check the branches of the tree decomposition not explored by 

P-i. 
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(pu [si]„) 


(P2, [S2]„) 

push„ 

{P3, [S2 S2]„) 


(P2, [S2]„) 


(P3, [S2 S2]„) 


(P8, [S2]„) 


{P4, [S3 S2]„) 
push„ 
(P5, [S3 S3 S2]„) 

{pe, [S4 S3 S2]„) 
POP„ 
(P7, [S3 S2]„) 
POP„ 
(P8, [S2]„) 


(P4, [S3 S2]„) 


{P9, [S5]„) 


{P5, [S3 S3 S2]„) {P7, [S3 Sa]^) 

{pe, [S4 S3 S2]„) 


(P9> [S5]„) 

(a) a run of P with pushes and pop^s C") The tree decomposition of the run 

marked. 


Figure 2: Tree decompositions of runs. 


3. By repeated applications of the above reduction, we obtain an 1-PDA P for which 
Diagonal^j (P) is decidable since the downward closure of a context-free grammar (equiv¬ 
alent to 1-PDA) is computable [321IH] and this is equivalent to the diagonal problem. 

The {n — 1)-PDA with tests P_i will simulate the n-PDA P in the following way. 

• All operations except for push„ and pop„ will be simulated directly. 

• In lieu of performing a push„, P_i will choose to simulate the run of P between the push 
and its corresponding pop„, or the run of P after the corresponding pop„ has taken place. 

— Tests will be used to determine which control state could appear after the corre¬ 
sponding pop„. 

— If the part of the run not being simulated output some as, then P will output a 
single a in place of the omitted as. 


Although P_i will output far fewer a characters than P (since it does not execute the full run), 
we show that it still outputs enough as for the language to remain unbounded. 
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We thus have the following theorem. 

Theorem 3.1 (Decidability of the Diagonal Problem). Given an n-PDA P and output char¬ 
acter a, whether Diagonal^{P) holds is decidable. 

Proof. We construct via Lemma lT^ an {n — 1)-PDA P' such that DiagonalQ(P) iff Diagonal(j(P') 
We repeat this step until we have a 1-PDA. It is known that Diagonal^(P) for an 1-PDA is 
decidable since it is possible to compute the downward closure [MllS]- □ 

3.2 HOPDA with Tests 

When executing a branch of the tree decomposition, to be able to ensure the branch is correct 
and whether we should output an extra a we need to know how the system could have behaved 
on the skipped branch. To do this we add tests to the HOPDA that allow it to know if 
the current stack belongs to a given regular set. We show in the following sections that the 
properties required for our reduction can be represented as regular sets of stacks. Although we 
take Broadbent et aZ.’s logical reflection as the basis of our proof, HOPDA with tests can be 
seen as a generalisation of pushdown systems with regular valuations due to Esparza et al. |12] . 

Definition 3.1 (n-PDA with Tests). Given a sequence of automata Ai,...,Am recognising 
regular sets of stacks, an n-PDA with tests is a tuple P = (P, E,r,7?., P, pin,7in) where 
P,E,r,P, Pin, and yin are as in HOPDA, and 

TZ C V X r X {Al,..., Am} X E X Ops^ x V 

is a set of transition rules. 

We write (p, 7, Ai) \ (p', o) for (p, 7, Ai, b, o,p') G TZ. We have a transition (p, s) \ (p', s') 
whenever (p, 7, Ai) (p', o) G 72. and top]^(s) = 7, s G C{Ai), and s' = o(s). 

We know from Broadbent et al. that these tests do not add any extra power to HOPDA. 
Intuitively, we can embed runs of the automata into the stack during runs of the HOPDA. 

Theorem 3.2 (Removing Tests). |21 Theorem 3 (adapted)] For every n-PDA with tests P, we 
can compute an n-PDA P' with C{P) = C{P'). 

Proof. This is a straightforward adaptation of Broadbent et al. [S] . A more general theorem is 
proved in Theorem 16. II □ 

3.2.1 Marking Outputs 

When the HOPDA is in a configuration of the form (p, - i.e. the outermost stack contains 

only a single order-(n — 1) stack - we require the HOPDA to be able to know whether, 

• for a given pi and p 2 , there is a run from (pi, [s]^) to (p 2 , []„) (that is, the HOPDA empties 
the stack), and 

• whether, during the run, an a is output. 

Given P, we first augment P to record whether an a has been produced. This can be done 
simply by recording in the control state whether a has been output. 
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Definition 3.2 {Pa)- Given P = (P, S, F, 7 ?., 7 in) we define 


where 


Pa = ( 7 ^U'Po,S,^, 7 ^U 7 ^a,. 7 ^U Jo,Pin, 7 in) 


Va 

Pa 


Pa 


{Pa \P&'P} 

|(Pa,7) (Pa,0) 

[{pn) ^ (p'a^o) 
{Pa \ p&T} 


b,7) b',o) e 7^| 

{p-,i) ^ {p\o) e 7e| 


u 


It is easy to see that P and Pa accept the same languages, and that Pa is only in a control 
state Pa if an a has been output. 


3.2.2 Building the Automata 

Fix some P = {V, E, F, P, P) and Pa = {Va, E, F, Pa, Pa)- To obtain a HOPDA with tests, we 
need, for each pi,p 2 S V the following automata. Note, we define these automata to accept 
order-(n — 1) stacks since they will be used in an (n — 1)-PDA with tests. 

1. accepting all stacks s such that there is a run of P from (pi, to (p 2 , []„), 

2. accepting all stacks s such that there is a run of P from (pi, to (p 2 , []„) that 

outputs at least one a. 

To do this we will use a reachability result due to Broadbent et al. that appeared in ICALP 
2012 [7]. This result uses an automata representation of sets of configurations. However, these 
automata are slightly different in that they read full configurations “top down”, whereas the 
automata of Theorem 13.21 (Removing Tests) read only stacks “bottom up”. 

It is known that these two representations are effectively equivalent, and that both form an 
effective boolean algebra Ii[7]. In particular, for a top-down automaton A and a control state 
p we can build a bottom-up stack automaton B such that (p, s) G C{A) iff s S P-{B) and vice 
versa. We recall the reachability result. 

Theorem 3.3. Theorem 1 (specialised)] Given an HOPDA P and a top-down automaton 
A, we can construct an automaton A' accepting Pre*p{A). 

Let Ap^j be a top-down automaton accepting configurations of the form (p, where 
topi(s) = 7 . Next, let 

Ap — Apf 

(p',7)-^(p.Pop„)G7i 

and 

^p— U ^Pt.7 

(p',7)-^(P,POPn)G'R- 

I.e. Ap and A“ accept configurations of Pa from which it is possible to perform a pop„ operation 
to p and reach the empty stack. 

Definition 3.3 {Ap^^p^ and Ap^ pfij. Using the preceding notation, given pi and p 2 we define 
bottom-up automata 

• ^Pi.P 2 where C{Ap^^pfi) = {s | (pi,[s]„) e Pre*p{Apfij} . 
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• ^P1.P2 “'^ere = {s \ {pi, [s]„) G PrepjA“J } . 

It is easy to see both Ap-^ and A^^ are regular and representable by bottom-up automata 
since both 

Prep(24pJ and Prep_^ (A“ J 

are regular from Theorem id.lll and bottom-up and top-down automata are effectively equivalent. 
To enforce only stacks of the form [s]^ we intersect with an automaton Ai accepting all stacks 
containing a single order-(n — 1) stack (this is clearly regular). 

3.3 Reduction to Lower Orders 

We are now ready to complete the reduction. Correctness is shown in Section 0] Let Apt be the 
automaton accepting all stacks. In the following definition, a control state (pi,P 2 ) means that 
we are currently in control state pi and are aiming to empty the stack on reaching p 2 , and the 
rules 7?.sim simulate all operations apart from push„ and pop,.^ directly, TZdn detect when the 
run is accepting, 72.push follow the push branch of the tree decomposition, using tests to ensure 
the existence of the pop branch, and T^pop follow the pop branch of the tree decomposition, 
also using tests to check the existence of the push branch. 

Definition 3.4 (P-i). Given an n-PDA P described by the tuple (P, E, P, 7?., {p/} ,pin, 7in) 
as well as families of automata {Ap^^p^)^^ p 2 &'P (^pi P 2 )p p ep define an in — \)-PDA 
with tests 

P-l = (T’-1,S,^,7^_1, J'-I, (Pin,p/) ,7in) 

where 

P-l = {{pi,P2) I Pl,P2 G T"} W {/} 

7^-1 = TZsim U TZfi n C P'push C P'pop 

P-l = {/} 

and we define 

• P-sim is the set containing all rules of the form 

((pi, 7'2),7, -^tt ) iiPl,P2),0) 

for all (pi, 7 ) (p'i,o) G P with o ^ {pMs/i„, pop„} and p 2 G V, and 

• Pfin is the set containing all rules of the form 

{iPl,P2),'l, ■^tt ) if, rew~f) 
for all (pi, 7 ) (p 2 ,pop„) G P, and 

• Ppush is the smallest set of rules containing all rules of the form 

iiPi,P 2 ),'l,Ap^p^) -4 i{p[,p),rew^) 

for all (pi,7) {p[,pushj^) G P and p,p 2 G P, and all rules of the form 

{iPi,P 2 ),l,A^p^) iip[,p),rew^) 
for all (pi, 7 ) A ip'i,push^) G P and p,P 2 G P, and 
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• T^pop is the set containing all rules of the form 

((Pi,P2),7,^Pi.p) iiP,P2),rew^) 

for all (pi, 7 ) A {p[,push^) € TZ and p,p 2 S V and all rules of the form 

((Pi,P2),7,^pi,p) {{p,P2),rew^) 

for all {pi,j) {p[,push^) € TZ and p,p 2 G 'P. 

In the next section, we show the reduction is correct. 

Lemma 3.1 (Correctness of P-i). 

Diagonal^{P) <;=> Diagonalg^{P-i) 

To complete the reduction, we convert the HOPDA with tests into a HOPDA without tests. 

Lemma 3.2 (Reduction to Lower Orders). For every n-PDA P we can construct an {n — 1)- 
PDA P' such that 

Diagonal^{P) Diagonal^{P') . 

Proof. From Definition 13.41 tP_i 1 and Lemma [3.11 (Correctness of P-i), we obtain from P an 
(n — 1)-PDA with tests P_i satisfying the conditions of the lemma. To complete the proof, we 
invoke Theorem 13.21 IRemoving Tests) to find P' as required. □ 


4 Correctness of Reduction 

This section is dedicated to the proof of Lemma [XT] (Correctness of P_i). 

The idea of the proof is that each run of P can be decomposed into a tree: each push„ 
operation creates a node whose left child is the run up to the matching pop„, and whose right 
child is the run after the matching pop„. All other operations create a node with a single child 
which is the successor configuration. 

Each branch of such a tree corresponds to a run of P-i- To prove that P_i can output 
an unbounded number of as we prove that any tree containing m edges outputting a must 
have a branch along which P_i would output log(m) a characters. Thus, if P can output an 
unbounded number of a characters, so can P_i. 


4.1 Tree Decomposition of Runs 

Given a run 

b2 bm 

P — Cq ^ Cl ^ Cm 

of P where each push„ operation has a matching pop„, we can construct a tree representation 
of p inductively. That is, we define Tree(c) = T[e\ for the single-configuration run c, and, when 

b / 
p = c-> p 

where the first rule applied does not contain a push„ operation, we have 

Tree(p) = 5[Tree(p')] 
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and, when 

p = Co Pi -4 P2 

with Cl being the first configuration of p 2 and where the first rule applied in p contains a push„ 
operation, cq = (p, s) and ci = (p', s) for some p,p', s and there is no configuration in pi of the 
form {p",s), then 

Tree(p) = e[Tree(pi),Tree(p 2 )] . 

An accepting run of P has the form p c where p has the property that all push„ operations 
have a matching pop„ and the final transition is a pop„ operation to c = (p, []^) for some p G P. 
Hence, we define the tree decomposition of an accepting run to be 


Tree^p A c^ = £[Tree(p), r[e]] . 


4.2 Scoring Trees 


In the above tree decomposition of runs, the tree branches at each instance of a push„ operation. 
This mimics the behaviour of P-i, which performs such branching non-deterministically. Hence, 
given a run p of P, each branch of Tree(p) corresponds to a run of P-i- 

We formalise this intuition in the following section. In this section, we assign scores to each 
subtree T of Tree(p). These scores correspond directly to the largest number of a characters 
that P-i can output while simulating a branch of T. 

Note, in the following definition, we exploit the fact that only nodes with exactly one child 
may have a label other than e. We also give a general definition applicable to trees with 
out-degree larger than 2. This is needed in the simultaneous unboundedness section. For the 
moment, we only have trees with out-degree at most 2. 

Let 


b = 


0 

1 



and 



m = 0 
m > 0 


Then, 


Score(T) 


'0 

Score(Ti) -b b 


max Score(ri) -b ^ Score(Tj 






T = T[e] 

T = &[Ti] 

T = e[Ti ,..., Tm] 


We then have the following lemma for trees with out-degree 2. 


Lemma 4.1 (Minimum Scores). Given a tree T containing m nodes labelled a, we have 


Score{T) > log(m) 


Proof. The proof is by induction over m. In the base case m = \ and there is a single node p 
in T labelled a. By definition, the subtree T' rooted at p has Score(T') = 1. Since the score of 
a tree is bounded from below by the score of any of its subtrees, we have Score(T) > log(l) as 
required. 

Now, assume to > 1. Find the smallest subtree T' of T containing to nodes labelled a. We 
necessarily have either 

1. T' = a[Ti], or 
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2. T' = e[Ti,T 2 ] where Ti and T 2 each have at least one node each labelled a. 
In case o we have by induction 

Score(T') = 1 + log(m — 1) > log(m) 


In case @ we have 


Score(T') 


= max 


Score(Ti) + Score(T 2 ), \ 
Score(T 2 ) + Score(Ti) ) 


We pick whichever of Ti and T 2 has the most nodes labelled a. This tree has at least I’m/2] 
nodes labelled a. Note, since both trees contain nodes labelled a, the right-hand side of the 
addition is always 1. Hence, we need to show 


log([m/2]) -I- 1 > log(m) 


which follows from 

log(m) - log(rm/ 2 ]) = log(j;^) 

< 

= log(2) = 1. 

By our choice of T' we thus have Score(r) = Score(T') > log(m) as required. □ 

4.3 From Branches to Runs 

Lemma 4.2 (Scores to Runs). Given an accepting run p of P, if Score{Tree{p)) = m then 
e C[P-i). 

Proof. Let pf he the final (accepting) control state of P and let T = Tree(p). We begin at the 
root node of T, which corresponds to the initial configuration of p. Let (p, [s]^) be this initial 
configuration and let ((p,p/), s) be the initial configuration of P-i- 

Thus, assume we have a node rj of T, with a corresponding configuration c = (p, s) of P 
and configuration c_i = ((p,Ppop), top„(s)) of P_i and a run p_i of P_i ending in c_i and 
outputting (m — Score(T')) a characters where T' is the subtree of T rooted at p. The subtree 
T' corresponds to a sub-run p' of p where the transition immediately following p' is a pop„ 
transition to a control state Ppop- 

There are two cases when we are dealing with internal nodes. 

• T' = b[Ti]. 

In this case there is a transition c \ c' via a rule (p, 7 ) \ (p', o) where o ^ {push„, pop„}. 
Hence, we have the rule ((p,ppop), 7, Att) A- ((p/ppop),o) in P-i and thus we can ex¬ 
tend p-i with a transition c_i A c'_i via this rule where p-i, c' and c'_i maintain the 
assumptions above. 

• r = e[Ti,T2]. 

In this case we have that T' corresponds to a sub-run 

c -4 Pi A P 2 
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of p. The transition from c to the beginning of pi is via a rule ri = {p, 7) (pi, push„) and 
the transition from the end of pi to the start of p 2 is via a rule r 2 = {p 2 ,Ji) ^ iP 3 , pop„). 
Moreover, from the definition of the decomposition, the final configuration in p 2 is followed 
in p by a pop rule rs = (^ 4 , 72 ) ^ (Ppop, pop„). 

There are two further cases depending on whether the score of T' is derived from the 
score of Ti or T 2 . 


— In the case of Ti, then, first observe that p 2 followed by an application of ra is a run 
from {p 3 , s) to (ppop, pop„(s)) where the stack pop„(s) does not appear in p 2 - Thus, 
there is a run of P from (pa, [top„(s)]^) to (ppop, []„) and moreover, this run outputs 
an a whenever the original run does. Hence, there is also a corresponding run of P 
from which outputs an a whenever the original run does. 

If an a is output, we have c_i G and Score(r') — Score(Ti) = 1. We can 

extend p via an application of the rule ^(p,Ppop)) 7 , ^pa.ppop) ((PiiPa): rew.^) that 

exists in P_i since c_i € This transition maintains the property on the 

stacks since the push„ copies the topmost stack, hence P_i does not need to change 
its stack. It maintains the property on the scores since it outputs a, accounting for 
the part of the score contributed by T 2 . Finally, the condition on control states is 
satisfied since the second component is set to p 2 - 

If an a is not output, then the case is similar to the above, except T 2 does not 
contribute to the score, we have c_i G ^(^ps.Ppop)? the transition of P_i is 
labelled £ instead of a. 

— The case of T 2 is almost symmetric to Ti. Observe that pi followed by an application 
of r 2 is a run from (pi, push„(s)) to (pa, s) where the stack s does not appear in pi. 
Thus, there is a run of P from (pi, [top„(s)]^) to (pa, []^) and moreover, this run 
outputs an a whenever the original run does. Hence, there is also a corresponding 
run of P from which outputs an a whenever the original run does. 

If an a is output, we have c_i G £(£1“^ and Score(T') — Score(r 2 ) = 1. We can 
extend p via an application of the rule ((p,Ppop )5 7 , ((PSiPpop), rew.y) that 

exists in P_i since c_i G This transition maintains the property on the 

stacks since the stack after the pop„ is identical to the stack before the push„, hence 
P_i does not need to change its stack. It maintains the property on the scores since 
it outputs a, accounting for the part of the score contributed by Ti. Finally, the 
condition on control states is satisfied since the second component is unchanged. 

If an a is not output, then the case is similar to the above, except Ti does not 
contribute to the score, we have c_i G C{Ap^^p^) and the transition of P_i is labelled 
£ instead of a. 

Finally, we reach a leaf node p with a run outputting the required number of as. We need 
to show that the run constructed is accepting. Let p' be the first ancestor of p that contains p 
in its leftmost subtree. Let T' be the subtree rooted at p'. This tree corresponds to a sub-run 
p' of p that is followed immediately by a pop„ rule (p, 7 ) (Ppop,pop„). Moreover, we have 
((p,Ppop), 7 , Tltt) (/, rew.y) with which we can complete the run of P-i as required. □ 
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4.4 The Other Direction 

Finally, we need to show that each accepting run of P_i gives rise to an accepting run of P 
containing at least as many as. 

Lemma 4.3 {P-i to P). We have Diagonalg^{P-i) implies Diagonalg^{P). 

Proof. Let pf be the unique accepting conrol state of P. Take an accepting run p_i of P_i. 
We show that there exists a corresponding run p of P outputting at least as many as. 

Let 

b b e / p \ 

Co —>•••—> Cm —>■(/, s) 

for some s be the accepting run of P_i. We define inductively for each 0 < i < m a pair of runs 
p\ ,p\ of P such that 

1 . P 2 ends in a configuration {pf, []^) (i.e. is accepting), and 

2. if a = {ip,Ppop),s) then 

(a) the final configuration of p\ is {p, [ssi... «;]„), for some si,..., s;, and 

(b) the first configuration of P 2 is (Ppop, [si... and 

3. the sum of the number of a characters output by p\ and p| is at least the number of a 

characters output by cq Ci. 

Initially we have cq = ((pi„,p/), s) and s = | 7 inl„_i- We define p? = {pin, [s]„) and p^ = 
{pf, []^) which immediately satisfy the required conditions. 

Assume we have p\ and p| as required. We show how to obtain pf^^ and p^'*’^. There are sev¬ 
eral cases depending on the rule used on the transition Ci > Cj+i. Let Cj = {{p,Ppop), s), the 
final configuration of p\ be {p, [ssi... Si]„) and the first configuration of p| be (ppop, [si • ■ • 5i]„). 

• If the rule was ((p,ppop), 7, Att) A- ((p',Ppop), o) with o ^ push„,pop„ then we have 

{p, 7 ) \ (p', o) GTZ and we define pf^^ to be p\ extended by an application of this rule. 
We also define = p|. 

The required conditions are inherited from p\ and P 2 since o only changes the top„ stack, 
the final configuration of P 2 ^^ is the same as p|, Ppop is not changed, and the rule of P 
outputs an a iff the rule of P_i does. 

• If the rule was ((p,Ppop), 7 , -4 ((p',pJ,op),rew.y) then we have a rule r = 

(P)7) (y^pash^) € TZ. Moreover, from the test we know there is a run of P 

from (ppop, [s]„) to (ppop, []„) and hence there is also a run p from (Pp^p, [ssi... «;]„) to 
(Ppop, [si • • • Siln)- ~ PP 2 and pf^^ to be p\ extended by an application of r. 

Since the final configuration of is {p', [sssi... Si]„) it is easy to check the required 
correspondence with the first configuration (Pp^p, [ssi... of 

The remaining conditions are immediate since no a is output and the final configuration 
of P 2 ^^ is the same as P 2 . 

• The case of ^(p,Ppop), 7 , A“, pp^p) ((^'^^'pop) A^w.^) is almost identical to the previ¬ 
ous case. To adapt the proof, one needs only observe that since Ci € -C^Ap, pppp) the 
run p used to extend p| also outputs at least one a character. 
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£ 


Oi 


02 


Oi 


02 


£ £ 

Figure 3: An example showing that following a single branch does not work for simultaneous 
unboundedness. 


• If the rule was ((p,Ppop), 7 , ((p2,Ppop),rew-^) then there is also a rule r = 

(Pj 7 ) ^ (PijPush„) G TZ and from the test we know there is a run of P from 

(pi, [s]^) to (p 2 , Ora) therefore there is also a run p that goes from (pi, [sssi... s;]^) 
to (p 2 , [ssi... sz]„)- We set to be p\ extended with an application of r and then the 
run p. We also set = p\. 

To verify that the properties hold, we observe that c^+i = {{p 2 ,Ppop), s), and ends 
with (p 2 , [ssi... Si]„) and P 2 ^^ still begins with (ppop, [si... 5 ;]^^) and has the required 
final configuration. The property on the number of as holds since the rule of P-i did not 
output an a. 

• The case of ((p,Ppop), 7, p^) A- ((p 2 ,Ppop), rew-^) is almost identical to the previous 

case. To adapt the proof, one needs only observe that since Ci G £(Ap^ p^) the run p used 
to extend p\ also outputs at least one a character. 

Finally, when we reach i = m we have from the final transition of the run of P-i that there is 
a rule (p, 7 ) A (ppop,pop„). We combine p™ and p™ with this pop transition, resulting in an 
accepting run of P that outputs at least as many a characters as the run of P-i. □ 

5 Multiple Characters 

We generalise the previous result to the full diagonal problem. Naively, the previous approach 
cannot work. Consider the HOPDA executing 

pushT*; push„; pop™; pop„; pop™ 

where the first sequence of pop^ operations output ai and the second sequence output 02 . 

The corresponding run trees are of the form given in Figure [3] In particular, P_i can only 
choose one branch, hence all runs of P_i produce a bounded number of ois or a bounded 
number of 028 . They cannot be simultaneously unbounded. 

For P_i to be able to output both an unbounded number of oi and 02 characters, it must be 
able to output two branches of the tree. To this end, we define a notion of a-branch HOPDA, 
which output trees with up to a branches. We then show that the reduction from n-PDA to 
(n — 1)-PDA can be generalised to a-branch HOPDA. 
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5.1 Branching HOPDA 

We define n-PDA outputting trees with at most a branches, denoted (n, q;)-PDA. Note, an 
n-PDA that outputs a word is an (n, 1)-PDA. Indeed, any (n, a)-PDA is also an (n, q;')-PDA 
whenever a < a'. 

Definition 5.1 ((n, q;)-PDA). We define an order-n a-branch pushdown automaton ((n, a)- 
PDA) to be given by a tuple P = (P, S, P, 7?., Pin, 7 in) 6 *) where V, S, P, T, pin, and 7 in are 
as in HOPDA. The set of rules TZ C [J PxPxEx Ops„ x together with a mapping 

l<m<Q 

0 : P —> { 1 ,..., a} such that for all (p, 7 , b, o,pi,... ,Pm) &TZ we have 6{p) > 6 *(pi) + - • •+9{pm)- 

We use the notation (p, 7 ) A- (pi,... ,pm, o) to denote a rule (p, 7 , 6 , o,pi,... ,pm) S TZ. 
Intuitively, such a rule generates a node of a tree with m children. The purpose of the mapping 
9 is to bound the number of branches that this tree may have. Hence, at each branching rule, 
the quota of branches is split between the different subtrees. The existence of such a mapping 
implies this information is implicit in the control states and an (n, a)-PDA can only output 
trees with at most a branches. 

From the initial configuration cq = (pinj [ 7 inl„) a run of an (n, a)-PDA is a tree T = {D, A) 
whose nodes are labelled with n-PDA configurations, and generates an output tree T' = {D, A') 
whose nodes are labelled with symbols from the output alphabet. Precisely 

• A(e) = Co, and 

• for a node ry with children 771 ,..., pm and A(? 7 ) = (p, s) there is a rule (p, 7 ) \ (pi,..., pm, o) 
such that for all 1 < i < m we have A(r/i) = (pi, s') where top]^(s) = 7 , s' = o(s). Moreover 
we have A'(p) = b. 

• For all leaf nodes p we have A'(p) = e. 

The run is accepting if for all leaf nodes p we have A(p) = (p, []„) and p G P. Let £-{P) be the 
set of output trees of P. 

Given an output tree T we write \T\a to denote the number of nodes labelled a in T. For 
an (n, a)-PDA P, we define 


Diagonal„^_ = 


Vm.3T e £(P).VI <i< a.\T\ai > m . 


6 Reduction For Simultaneous Unboundedness 

Given an (n, q;)-PDA P we construct an (n — l,a)-PDA P_i such that 

Diagonal„^_ Diagonal,,^^ (P_i) . 

Moreover, we show Diagonal^,^ (P) is decidable for a (0,a)-PDA (i.e. a regular automaton 
outputting an a-branch tree) P. 

For simplicity, we assume for all rules (p, 7 ) A (pi,... ,Pm,o) if m > 1 then o = rew.^ (i.e. 
the stack is unchanged). Additionally we have b = s. 

We also make analogous assumptions to the single character case. That is, we assume 
S = {oi,..., Oq, e} and use b to range over S. Moreover, all rules of the form (p, 7 ) \ (p', o) 
with o = push„ or o = pop„ have b = e. Finally, we assume acceptance is by reaching a unique 
control state in T with an empty stack. 
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6.1 Some Intuition 

We briefly sketch the intuition behind the algorithm. We illustrate the reduction from (n, a)- 
PDA to (n — 1, a)-PDA in Figured! 

• We begin with an n-PDA which we first interpret as an (n, Q!)-PDA. This is possible 
because an (n,a)-PDA can produce at most a branches. Thus, an n-PDA — which 
produces a single branch — is also a (n, q;)-PDA. We work with HOPDA producing a 
branches because, after each reduction step, we will need to output one branch for each 
character in oi,..., Oq. 

• We have an (n, a)-PDA P that outputs a tree with at most a branches. In Figure dj 
we show part of a run tree with 2 branches. The push„ and pop„ operations are shown 
on the edges of the tree. Nodes are numbered to help identify them during the different 
transformations. 

• We “decompose” this tree into another tree where the branches appearing after the pop„ 
operations are hung from the same parent as their matching push„. This is shown in 
the middle of Figured! Notice that this tree has an unbounded number of branches (it 
branches at each push„). However, we know that the maximum out-degree of any of its 
nodes is (a + 1) since the source of a push„-labelled edge has one child, and we add at 
most a extra children corresponding to the pop„ on each of its at most a branches. 

• We prove a generalisation of Lemma 14.11 (Minimum Scores) that shows a run tree with 
at least m instances of a character a has a branch with a score of at least log(,^_,_]^)(m). 
Thus, we need to select one branch for each a we wish to output. 

• We build an (n — l,a)-PDA P_i that non-deterministically picks out the highest scoring 
branches for each a. This is shown on the right of Figured! 


6.2 Branching HOPDA with Regular Tests 

As before, we instrument our HOPDA with tests. Removing these tests requires a simple 
adaptation of Broadbent et al. [5] . 

Definition 6.1 ((n, q;)-PDA with Tests). Given a sequence of automata Ai,..., Am, an (n, a)- 
PDA with tests is given by a tuple P — (P, S, F, 7?., Pm, 7 in) 6 *) where V, S, F, P, pm, yin are 
as in HOPDA. The set of rules 7?. C IJ 7^ x F x {Ai,..., Am} x S x Ops„ x 7^™ together 

l<m<Q: 

with a mapping 9 : V ^ {!,...,a} such that for all (p, 7 , A, 6 , o,pi,... ,prn) € TZ we have 
9{p) > 9{pi) H-h 9{pm)- 

We use the notation (p, 7 , A) \ (pi,... ,Pm, o) to denote a rule (p, 7 , A, 6 , o,pi,... ,Pm) S 7Z. 
From the initial configuration cq = (pm, l 7 inl„) a run of an (n, q;)-PDA with tests is a tree 
T = {D, A) and generates an output tree p = {D, A') where 

• A(e) = Co, and 

• for a node rj with children r]i,...,r]m and X{ri) = {p,s) there is a rule (p, 7 ,A) \ 
(pi,... ,Pm, o) such that s G D{A) and for all 1 < * < m we have \{rji) = {pi, s') where 
topi(s) = 7 , and s' = o(s). Moreover we have X'{ri) = b. 
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(a) An o-branch run tree of (b) The decomposition of the run tree (c) An a-branch 

P subtree of |[4bll 


Figure 4: Illustrating the reduction steps. 


• For all leaf nodes rj we have A'(ry) = £. 

The run is accepting if for all leaf nodes rj we have X{r]) = {p, []^) and p € T. Let C{P) be the 
set of output trees of P. 

Theorem 6.1 (Removing Tests), Theorem 3 (adapted)] For every {n,a)-PDA with tests 
P, we can compute an {n,a)-PDA P' with C{P) = C{P')- 

Proof. This is a straightforward adaptation of Broadbent et al. [5]. Let the (n, a)-PDA with 
tests be P = (P, S, F, TZ, P,pin, 7in, d) with test automata Ai,..., Am- We build an (n, a)-PDA 
that mimics P almost directly. The only difference is that each character 7 appearing in the 
stack is replaced by 

For each test A we have a vector of functions 

The function Tk : Q ^ Q intuitively describes runs of A from the bottom of topfc_|_]^(s) to the 
top of pop;.(top;.^]^(s)). Thus, we can reconstruct an entire run over pop]^(s) from initial state 
q as 

g' = Ti(---T„(g)) 

and then we can consult A to complete the run by adding the effect of reading top]^(s). 

Thus, let Aj = (Qj,r[],g?„, Aj, Q^p). We define 

P= (v,T,,t,'}Z,P,Pin,iin,d^ 
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where 

r = {(7.P1. ■ • ■ ,^m) I 7 G r A Vi.Tj € (Q^ Q*)”} 

and iZ is the smallest set of rules of the form 

(p.7) (pi,...,Pi,Update(o,7)) 

where 7 = (7, ri,..., r^) and {p,j,Ai) \ {pi,... ,pi,o) G TZ and Accepts(7, r*, A*, 
and we define 

Accepts(7,Ti,... ,r„, A,(7i„, Qp) 
q = ri(- • •T„(( 7 in)) A A{q, [„• • • [17) e Qf 

where A(<7, [„• • • [17) is shorthand for the repeated application of A on 7 then [1, back to 
and we define Update(o, 7) = d following the cases below. Let 7 = (7, ti, ... ,Tm)- 

• When o = rewg- then 0 = (ct, ti, ...,Tm). 

• When o = push^ then d = push^; rew^^ n where for all i we have 

Ti — ('^1 j ■ ■ ■ jTk — ljTf., Tfc_|_i , . . .Tji) 


and 

Tkiq) = Tfc(A,(Ti(- • •Tfc(( 7 )),]fc_i[fc_r •• [17)). 

I.e., we apply the functions to read the whole stack once, and then the correct part of the 
copy created by the push^. 

• When o = pop^ then 


o = popo; (cr, t[,..., rew^_^^ 
where for all i we have Tj = (ti, ..., t„) and r' = (r{,..., r^) and 

A = (A) • ■ • ) Pfc-l) Tk, ■ ■ ■ Tji) ■ 

We can see that this is correct since we do not update the functions that read parts of 
the stack unchanged (i.e., stacks outside of those changed by the pop^), and we take the 
functions that are correct for the newly exposed top parts of the stack for the remaining 
functions. 

Finally, we set 7in = ( 7 in,Pi) ■ • ■ ,Tm) where for each i we have Ti = (ti, ... ,Tn) such that for 
each k we have Tk{q) — A{q,]k ■ • •]«). □ 

6.3 Building The Automata 

Previously we built automata to indicate that from pi, the current top stack could be 

removed, arriving at p 2 - This is fine for words, however, we now have a-branch trees. It 
is no longer enough to specify a single control state: the top stack may be popped once on 
each branch of the tree, hence for a control state p we need to recognise configurations with 
control state p from which there is a run tree where the leaves of the trees are labelled with 
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configurations with control states pi, ■ ■ ■ ,Pm and empty stacks. Moreover we need to recognise 
the set O of characters output by the run tree. More precisely, for these automata we write 


where 9{p) > 0{pi) + • • • + 9{pjn) and O C {oi,..., Ua}- We have s £ iff there 

is a run tree T with the root labelled (p, [s]^) and m leaf nodes labelled {pi, []„),..., (pm, []„) 
respectively. Moreover, we have a G O iff the corresponding output tree T' has \T'\a > 0. 

6.3.1 Alternating HOPDA 

To construct the required stack automata, we need to do reachability analysis of (n, a)-PDA. 
We show that such analyses can be rephrased in terms of alternating higher-order pushdown 
systems (HOPDS), for which the required algorithms are already known [7]. Note, we refer to 
these machines as “systems” rather than “automata” because they do not output a language. 

Definition 6.2 (Alternating HOPDS). An alternating order-n pushdown system is a tuple 
P = {V,T,TZ) where V is a finite set of control states, T is a finite stack alphabet, and 

nc(VxTx Ops^ X P) U (P X r X 2^) 


is a set of transition rules. 


We write (p, 7 ) {Pjo) to denote (p, 7 , o,p) G TZ and (p, 7 ) —J- pi,...,Pm to denote 

An run of an alternating HOPDS may split into several configurations, each of which must 
reach a target state. Hence, the branching of the alternating HOPDS mimics the branching of 
the (n,a)-PDA. Given a set C of configurations, we define Prep(C') to be the smallest set C 
such that 


C 


C U 

I {P^ s) 

i {p, s) 


(p, 7 ) (p',o) G P A 'I 

topi(s)=7A > 

{p',o{s))£C' ] 

(p,!) ^>Pi,...,Pm G P A 
topi(s) = 7 A 
Vi.(pi,s) G C" 


U 



6.3.2 Constructing the Tests 

fn order to use standard results to obtain A^ p^^ we construct an alternating HOPDS P* 
and automaton A such that checking c G Prep^(A) for a suitably constructed c allows us to 
check whether s G p^). 

The alternating HOPDS P* will mimic the branching of P with alternating transitions^ 
(P) 7) Pi) • ■ • )Pm of Po. It will maintain in its control states information about which char¬ 
acters have been output, as well as which control states should appear on the leaves of the 
branches. This final piece of information prevents all copies of the alternating HOPDS from 
verifying the same branch of P. 

^We slightly alter the alternation rule from ICALP 2012 [7] by matching the top stack character as well as 
the control state. This is a benign alteration since it one can track the top of stack character in the control 
state. 
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Definition 6.3 (P<>). Given an {n,a)-PDA P described by the tuple (P, P',pin,7in); of 

P, we define 

Ps = (Po, r, Po) 


where 


1 



I < m < a A '1 

Po = 

^{p,0,pi,.. 

■ ■ ) Pm) 

0 C {ai,.. .,aa} A > 
Pl,...,Pm&V ] 


and Po is the set of rules containing, for each 


ip,i) (p',o) € n 


all rules 

{{pi,0\{b},pi,...,pi),o) 

and for each 

{p,l) ^ {Pi, ■ • ■ ,Pm, rew^) e n 

with m > 1 all alternating rules 

{{P,0,p[,... ,p'f),j) {pi,Oi,pI,...,pIJ ,...{pjn,Om,pT,---,PZ.) 

where p'l,... ,pi is a permutation of pi,... ,pj_^,.. .p'f',... and O = Oi U • • • U Om- 

In the above definition, the permutation condition ensures that the target control states 
are properly distributed amongst the newly created branches. 

Lemma 6.1. We have s G iff 

{{p,0,pi,...,pm),[s]n) € Pre*pJA) 

where A is such that 


= {{{P,^,P),^n) \ P^{Pi,---,Pm}} ■ 


Proof. First take s G tC.{App,^ p^) and the run tree witnessing this membership. We can 
move down the tree, maintaining a frontier ci,..., c; and building a tree witnessing that 
((p, 0,pi,... ,Pm), [s]„) G Prep^(Gl). Initially we have the frontier (p, [s]^) and the initial con¬ 
figuration ((p, 0,pi,... ,pm), [s]„). 


Hence, take a configuration c = fp', s') from the frontier and corresponding configuration 
c' = {(p',0',p'i,... ,p'fj,s'). If the rule applied to c is not a branching rule, we simply take 
the matching rule of Po and apply it to c'. Note, that if the rule output b we remove b from 
O'. Hence, O' contains only characters that have not been output on the path from the initial 
configuration. 

If the rule applied is branching, that is (p', 7 ) (p",... ,p", rew.^) then we apply the rule 

((p',0,pi,... ,p'),7) -5- {p'i,Oi,p\,... ,p\) ,... (^p'j,Oj,pl,... ,pij 


where p '^,... ,p' is a permutation of p},... ,p^^,.. .p{,... ,p™ and O = Oi U • • • U Om- These 
partitions are made in accordance with the distribution of the leaves and outputs of the run 
tree of P. I.e. if a control state p" appears on the i'th subtree, then it should appear in the i'th 
target state of Po- Similarly, if the i'th subtree outputs an & G O, then b should be placed in 
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Oi'. Applying this alternating transition creates a matching configuration for each new branch 
in the frontier. 

We continue in this way until we reach the leaf nodes of the frontier. Each leaf (p', s) has a 
matching ((p', 0,p'), s) and hence is in C{A). Thus, we have witnessed ((p, 0,pi,... ,Pm), [s]„) G 
Prep^(A) as required. 

To prove the other direction, we mirror the previous argument, showing that the witnessing 
tree for can be used to build a run tree of P. 

□ 


It is known that Prep (A) is computable for alternating HOPDS. 

Theorem 6.2. ^ Theorem 1 (specialised)] Given an alternating HOPDS P and a top-down 
automaton A, we can construct an automaton A' accepting Pre*p{A). 

Hence, we can now build from the control state p and top-down automaton 

representation of Prep^ (A) since we can effectively translate from top-down to bottom-up stack 
automata. 

6.4 Reduction to Lower Orders 

We generalise our reduction to (n, a)-PDA. Let Att be the automata accepting all configu¬ 
rations. Note, in the following definition we allow all transitions (including branching) to be 
labelled by sets of output characters. To maintain our assumed normal form we have to re¬ 
place these transitions using intermediate control states to ensure all branching transitions are 
labelled by e and all transitions labelled O are replaced by a sequence of transitions outputting 
a single instance of each character in O. 

The construction follows the intuition of the single character case, but with a lot more 
bookkeeping. Given an (n, a)-PDA P we define an (n — 1, a)-PDA with tests P_i such that 
P satisfies the diagonal problem iff P_i also satisfies the diagonal problem. The main control 
states of P-i take the form 

{p,Pi,...,Pm,0,B) 

where p,pi,... ,pm are control states of P and both O and B are sets of output characters. We 
explain the purpose of each of these components. 

We will define P_i to generate up to m branches of the tree decomposition of a run of P. In 
particular, for each of the characters a G {ai^... ,aa} there will be a branch of the run of P_i 
responsible for outputting “enough” of the character a to satisfy the diagonal problem. Note 
that two characters a and a' may share the same branch. When a control state of the above 
form appears on a node of the run tree, the final component B makes explicit which characters 
the subtree rooted at that node is responsible for generating in large numbers. Thus, the initial 
control state will have B = {oi,..., Oa} since all characters must be generated from this node. 
However, when the output tree branches - i.e. a node has more than one child - the contents of 
B will be partitioned amongst the children. That is, the responsibility of the parent to output 
enough of the characters in B is divided amongst its children. 

The remaining components play the role of a test A^^^^ That is, the current node is 

simulating the control state p of P, and is required to produce m branches, where the stack is 
emptied on each leaf and the control states appearing on these leaves are pi, ■ ■ ■ ,Pm- Moreover, 
the tree should output at least one of each character in O. 

Note, P_i also has (external) tests of the form A^p^_ that it can use to make decisions, 
just like in the single character case. However, it also performs tests “online” in its control 
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states. This is necessary because the tests were used to check what could have happened on 
branches not followed by P-i. In the single character case, there was only one branch, hence 
P-i would uses tests to check all the branches not followed, and then continue down a single 
branch of the tree. In the multi-character case the situation is different. Suppose a subtree 
rooted at a given node was responsible for outputting enough of both oi and 02 . Amongst 
the possible children of this node we may select two children: one for outputting enough oi 
characters, and one for outputting enough 02 characters. The alternatives not taken will be 
checked using tests as before. However, the child responsible for outputting m may have also 
wanted to run a test on the child responsible for outputting 02 . Thus, as well as having to 
output enough 02 characters, this latter child will also have to run the test required by the 
former. Thus, we have to build these tests into the control state. As a sanity condition we 
enforce O fl H = 0 since a branch outputting a should never ask itself if it is able to produce at 
least one a. 

We explain the rules of P_i intuitively. It will be beneficial to refer to the formal definition 
(below) while reading the explanations. The case for T^push is illustrated in Figure [S] since it 
covers most of the situations appearing in the other rules as well. 

• The rules in T^init guess how many branches will be needed to output enough of each a. 
(This might be less than a since one branch might account for several characters.) 

• The rules in TZfin check whether the run can be finished (always via a pop„ since we are 
aiming for the empty stack). This is true if we only have one branch to complete (just 
reach p') and we have no more characters that we’re obliged to output. 

• The rules in T^sim simulate a non-branching operation. They do this faithfully, simply 
passing along all information (updating O if a character is output by the simulated 
transition). 

• The rules in are the first of the complicated rules. This is mainly a matter of notation. 

The reasoning behind the rules is that we’re at a point where the tree splits into I different 
branches. These have control states p[,... ,p'i respectively. We non-deterministically guess 
which of these branches should output which of the characters in B. Thus, we split B into 
Bi,..., Bi. This means we are exploring i branches. Let cci,..., cci be the control states on 
these branches. The remaining branches we handle using tests on the stack. Let j/i,..., 
be the control states appearing on these branches. We require that all of are 

accounted for, so we assert that p{,... ,p'i is a permutation of a;i,..., cci, yi,..., y^. 

Similarly, in the current subtree we are obliged to pop to leaf nodes containing the control 
states pi,... ,pm- We split these obligations between the branches we are exploring and 
those we are handling using tests. We use another permutation check to ensure the 
obligations have been distributed properly. 

Finally, we are required to output characters in O. We may also, in choosing a particular 
branch for a character o, need to output a to account for instances appearing on a missed 
branch. Hence we also output O' to account for these. We distribute the obligations O 
and O' amongst the different branches using Ai,..., and Yi,..., Yj. 

• The rules in 7?.push and 7?.pop follow the same intuition as in the single character case, 
except we have the branching to deal with. In particular, at a push we have one branch 
corresponding to exploring what happens between the push and the corresponding pops, 
and a branch for each of the corresponding pops. We choose a selection of these branches 
to track with the HOPDA and a selection to handle using tests. The difference between 
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P 


\ push„ 
P' 



Pl,...,Pm 


Figure 5: Illustrating the rules in 72.push- 


'^push and T^pop is that the former explores the branch of the push using the HOPDA 
and the latter uses a test. 

In these rules, after the push we’re in control state p' and we guess that we will pop to 
control states p[,... ,p[. Hence we have a branch or a test to ensure that this happens. 
The remaining branches and tests are for what happens after the pops. The start from 
the states Pi,...,p'i and must, in total, pop to the original pop obligation pi,... ,Pm- 
Hence, we distribute these tasks in the same way as the 7?.br- 

Before giving the formal definition, we summarise the discussion above by recalling the 
meaning of the various components. A control state (p,pi,... ,Pm, O, B) means we’re currently 
simulating a node at control state p that is required to produce m branches terminating in 
control states pi,... ,Pm respectively, that the produced tree should output at least one of each 
character in O and the entire subtree should output enough of each character in B to satisfy the 
diagonal problem. In the definition below, the set O' is the set of new single character output 
obligations produced when the automaton decides which branches to follow faithfully and which 
to test (for the output of at least one of each character). The sets Xi,... ,Xi and Yi,..., Yj- 
represent the partitioning of the single character output obligations amongst the tests and new 
branches. 

The correctness of the reduction is stated after the definition. A discussion of the proof 
appears in Section 0 

Definition 6.4 (P-i). Given an {n,a)-PDA P described by (P, S, F, 7^, {py} ,pin, 7 in, 6*) and 
automata for all 1 < m < a, p,pi,... ,Pm G P, and O C {oi,..., Oa} we define an 
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(n — \,a)-PDA with tests 




where V-i is the set 


{PinJ} 


1 < m < a A 
p,pi,... ,pm & V A 
0 ,B C {ai,...,aa} A 
OnB = $ 


> W 


and 


TZ-I = TZinitATZsim^Pbr^B-finATZpush^'B-pop 

P-1 = {/} 

and 0 -i{{p,pi, ... ,Pm, O, B)) = \B\ and is 1 for all other eontrol states. We define the sets of 
rules, where in all cases, pi,... ,pm G V and O, O', B C {ai,..., Ua}, to he as follows: 

• B-init is the set containing all rules of the form 

(pinnin) iiPin,Pf, . . . , P/ , 0, {fll , . . . , Oa }) , reW^.J 
where \pf,...,pf\ < a, and 

• Bfin is the set eontaining all rules of the form 

((p,p', 0,B),7, Att) 4 if,rew^) 
for all (p, 7) {p',pop^) G TZ and B C {oi,... ,aa}, and 

• Bsim is the set containing all rules of the form 

Up,P i, ■ ■ ■ ,Pm, o, B), 7, Att) Up',P i, ...,Pm,o\ {&}, b), o) 

for (p, 7) A- (p',o) G TZ, and o ^ {push„,pop„}, and 

• TZbr is the set containing all rules of the form 

( 


iP,Pi,---,P-m,0,B),j, 


A 




O'DB / 
-^ 

^ {xi,x\,.. 

■,x]^,Xi,Bi) 

. . . , 

\ 

^ {xi,x\,.. 

.,xi.,Xi,Bf) 


, I Z^LV-y 


V 

where 

(P, 7 ) ^ (Pi, ■■■,Pi, rewj) G TZ 
and p'l,.. .p'l is a permutation of 


X \,. . . , Xj, ^1, . . . , 

and pi^... jPm is a permutation of 

x).yl, 

and 

O U O' = U • • • U X* U Yi U • • • U Y,- 

and B = Bi L) ■ ■ ■ U Bi. 
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• T^push is the set containing all rules of the form 


, rewj 

where 

(pn) {p',pushj 

and p[,.. .p'l is a permutation of 


( 




(p,Pi,...,Pm,0,B),j, 


V 


A 


o'nB 
- 5 - 


y 3 ,vi,---,yi- ) 


..., 

y (^Xi , • 5 , X'l , Bj^'j 


Xi , . . . , Xj, yi,..., 

and pi,... ^Pm 'i’S a permutation of 

xl,... ,x]^,... x\,... ,x].yl,... .. .y{,... ,yl. 

and 

o u O' = X u Xi u • ■ • u Xi u n u • • ■ u y,- 

and B = Bp U • • • U Si. 


• we have TZpop is the set containing all rules of the form 


/ 

ip,Pi, ■ ■ ■ ,Pm,0,B),J, 

V 


n- • -n 

yj,y[,...,y^_ 


\ 


O'nB 
->■ 


{xi,x\, 

{xi,x\ 


where 

{P,l) {p',push^) 

and p'l,.. .p[ is a permutation of 


,x].,Xi,Bi) 



xi,..., Xi, 2/i, ■ • ■, 

and pi,. -. ^Pm ^5 o, permutation of 

x\,...,x]^,...x\,..., xt.yl,... ,y]^,.. .y{,... ,yl. 

and 

o U O' = y U Xi U • • • U X, U Fi U • • • U Fj 

and B = BiVJ ■ ■ ■ \J Bi. 

In Section [7] we show that the reduction is correct. 

Lemma 6.2 (Correctness of P_i). 

Diagonal^^^ Diagonal^^^ 

To complete the reduction, we convert the (n, a)-PDA with tests into a (n, a)-PDA without 
tests. 
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Lemma 6.3 (Reduction to Lower Orders). For every (n,a)-PDA P we can build an order- 
(n — 1) a-branch HOPDA P' such that 

Diagonal^^^ ,^^{P) Diagonal^^^ ,^^{P') . 

Proof. From Definition 16.41 (P_i i and Lemma [6.21 (Correctness of P-i), we obtain from P an 
(n — l,a)-PDA with tests P-i satisfying the conditions of the lemma. To complete the proof, 
we invoke Theorem [6T] (Removing Tests) to find P' as required. □ 

We show correctness of the reduction in Section [T] First we show that we have decidability 
once we have reduced to order-0. 


6.5 Decidability at Order-0 

We show that the problem becomes decidable for a 0-PDA P. This is essentially a finite state 
machine and we can linearise the trees generated by saving the list of states that have been 
branched to in the control state. After one branch has completed, we run the next in the list, 
until all branches have completed. Hence, a tree of P becomes a run of the linearised 0-PDA, 
and vice-versa. Since each output tree has a bounded number of branches, the list length 
is bounded. Thus, we convert P into a finite state word automaton, for which the diagonal 
problem is decidable. Note, this result can also be obtained from the decidability of the 
diagonal problem for pushdown automata. 


Definition 6.5 (P). Given an {0,a)-PDA P described by the tuple (P, S, F, P, P,pin, yin, 0) 
we define a 0-PDA 

P= (P,i;,r,P,P,Pin,7in) 


such that 


T’ = |(P,Pl,7U • • -Pm^lm) 

and TZ is the set containing all rules of the form 


p,pi,... ,pm e P A 

er A }u{f} 
0 < m < a 


((P,Pl,7U ■ • ■ ,Pm,7m), 7) A p[ 


/ Pli 7li • ■ • iPmi 7mj 


P 2 ,a,...,Pi,cr 


, reWa 


for each 

(p, 7 ) \ ,pi, rewa) e P 

and all rules 

■ ,Pm,7m),7) A ((pi,P2,72, ■ ■ ■ ,Pm,'lni), reWj,) 

whenever p € iF. 

Lemma 6.4 (Decidability at Order-0). We have 

Diagonal^^^ ,^JP) Diagonal^^^ ,^^(P) 

and hence Diagonal^^^ (P) is decidable. 
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Proof. Take an accepting run tree p of P. If this tree contains no branching, then it is straight¬ 
forward to construct an accepting run of P. Hence, assume all trees with fewer than a branches 
have a corresponding run of P. At a subtree c[Ti,..., Tm] we take the run trees pi,..., pm 
corresponding to the subtrees. Let c = (p, 7 ) and Ci = (pi, 7 ),..., Cm = {pm,^) be the con¬ 
figurations at the roots of the subtrees. We build a run beginning at c and transitioning to 
{{pi,P 2 , 7, ■ • ■ ,Pm,j), 7 )- The run then follows pi with the extra information in its control state. 
After Pi accepts, we transition to ((p 2 ,P 3 , 7 j ■ • ■ ,Pm,,l),l) and then replay p 2 - We repeat un¬ 
til all subtrees have been dispatched. This gives an accepting run of P outputting the same 
number of each a. 

In the other direction, we replay the accepting run p of P until we reach a configuration 
{{Pi,P2,'l, ■ ■ ■ ,Pm,'l),l) via a rule 


(P,cr) ^ {{pi,p2,'y, ■ ■ ■ ,Pm,l),rew^) 


At this point we apply 

(P,cr) ^ (pi,...,Pm,rewT,) 

of P. We obtain runs for each of the new children as follows. We split the remainder of the run 
p' into m parts p'l, ..., where the break points correspond to each application of a rule of 
the second kind. For each i we replay the transitions of p'l from {pi^j) to obtain a new run of 
P with fewer applications of the second rule. Inductively, we obtain an accepting run of P that 
we plug into the zth child. This gives us an accepting run of P outputting the same number of 
each a. □ 


6.6 Decidability of The Diagonal Problem 

We thus have the following theorem. 

Theorem 6.3 (Decidability of the Diagonal Problem). For an n-PDA P and output characters 
tti,..., Oa, it is decidable whether Diagonal^^.^ 

Proof. We first interpret P as an (n,a)-PDA and then construct via Lemma 1^31 (Reduction to 
Lower Orders) an (n — 1, a)-PDA P' such that Diagonab^ (P) iff Diagonab^ _„^(P'). We 
repeat this step until we have an (0,a)-PDA. Then, from Lemma [53] (Decidability at Order-0) 
we obtain decidability as required. □ 


7 Correctness for Simultaneous Unboundedness 

In this section we prove Lemma 16.21 (Correctness of P_i). The proof follows the same outline 
as the single character case. To show there is a run with at least m of each character, we take 
via Lemma 17.11 (Section I7.2|l , m' = (a + I)’", and a run of P outputting at least this many of 
each character. Then from Lemma IT^ fSection 17.311 a run of P_i outputting at least m of each 
character as required. The other direction is shown in Lemma 17.31 fSection 17.4p . 

We first generalise our tree decomposition and notion of scores. We then show that every 
a-branch subtree of a tree decomposition generates a run tree of P_i matching the scores of 
the tree. Finally we prove the opposite direction. 
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7.1 Tree Decomposition of Output Trees 

Given an output tree T of P where each push„ operation has a matching pop„ on all branches, 
we can construct a decomposed tree representation of the run inductively as follows. We define 
Tree(T[e]) = T[e] and, when 

T = b[Ti,... ,T^] 

where the rule applied at the root does not contain a push„ operation, we have 

Tree(T) = 6[Tree(ri),..., Tree(Tm)] . 


In the final case, let 

T = e[T'] 

where the rule applied at the root contains a push„ operation and the corresponding pop„ 
operations occur at nodes rji,... ,r]m- 

Note, if the output trees had an arbitrary number of branches, m may be unbounded. In 
our case, m < a, without which our reduction would fail: P_i would be unable to accurately 
count the number of pop„ nodes. In fact, our trees would have unbounded out degree and 
Lemma 14.11 (Minimum Scores) would not generalise. 

Let Ti,..., Tm be the output trees rooted at rji,... ,r]m respectively and let T' be T with 
these subtrees removed. Observe all branches of T are cut by this operation since the push„ 
must be matched on all branches. We define 

Tree(T) = £[Tree(T'), Tree(Ti),..., Tree(rj„)] . 

An accepting run of P has an extra pop„ operation at the end of each branch leading to 
the empty stack. Let T' be the tree obtained by removing the final pop„-induced edge leading 
to the leaves of each branch. The tree decomposition of an accepting run is 

Tree(r) = e[Tree(r'), T[e],..., T[£]] 

where there are as many r[£] as there are leaves of T. 

Notice that our trees have out-degree at most (a -I- I). 

7.2 Scoring Trees 

We score branches in the same way as the single character case. We simply define Scorea(p) to 
be Score(p) when a is considered as the only output character (all others are replaced with e). 

We have to slightly modify our minimum score lemma to accommodate the increased out- 
degree of the nodes in the trees. 

Lemma 7.1 (Minimum Scores). Given a tree T with maximum out-degree (a-|- 1), containing, 
for each a € {oi,..., Oa}, at least m nodes labelled a, for each a € {oi,..., Oq,} we have 

Scorea{T) > log(„+i)(TO) 

Proof. This is a simple extension of the proof of Lemma 14.11 (Minimum Scores). We simply 
replace the two-child case with a tree with up to (a -I- 1) children. In this case, we have to use 
log(„+ 2 ) rather than log to maintain the lemma. □ 
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7.3 From Branches to Runs 

Lemma 7.2 (Scores to Runs). Given an accepting output tree p of P, if for all a € {oi,..., Oq} 
we have Scorea{Tree(p)) > m, then 3T G C{P-i) with \T\a > m for all a € {oi,..., Oq,}. 

Proof. We will construct a tree p-i in C{P-i) top down. At each step we will maintain a 
“frontier” of p_i and extend one leaf of this frontier until the whole tree is constructed. The 
frontier is of the form 

... ,ci,rii,OiBi) 

which means that there are I nodes in the frontier. We have i?i l±l • • • l±) = {oi, ..., 00 -} and 

each Bi indicates that the ith branch, ending in configuration Ci, is responsible for outputting 
enough of each of the characters in Bi. Each rji is the corresponding node in Tree(p) that is 
being tracked by the ith branch of the output of P_i. 

Let pf he the final (accepting) control state of P and let T = Tree(p). We begin at the 
root node of T, which corresponds to the initial configuration of p. Let (p, [s]„) be this initial 
configuration and let c = {{p,pf,... ,pf, 0, {oi,..., Oq}), s) be the configuration of P_i after an 
application of a rule from Pinit- The initial frontier is (c, s, {oi,..., Oq,}). 

Thus, assume we have a frontier 

(ci, 7 ?i, Oi, Pi,..., Ch, Vh, Oh, Bh) 

and for each of the sequences c_i, rj, O, B of the frontier we have 

1. T' is the subtree of T rooted at 77 , and 

2 . c = (p, s) labelling ij, and 

3. c_i = ((p,pi,...,Pm,0,P),top„(s)), and 

4. the node of p corresponding to p has m locations where the top„ stack is first popped via 
rules reaching pi,... ,Pm, moreover, these leaves have corresponding leaves in T', and 

5. the branch from the root of the constructed run to the node labelled c_i in the frontier 
outputs, for each a G P, at least (m — Scoreo(T')) occurrences of a, and 

6 . O n P = 0 and for each a € O there is at least one node labelled by a in T'. 

Pick such a sequence c_i,? 7 , O, P. We replace this sequence using a transition of P_i in 
a way that produces a new frontier with the above properties and moves us a step closer to 
reaching leaves of T. There are three cases when we are dealing with internal nodes. 

, T = b[Ti]. 

In this case there is a transition c ^ c' via a rule (p, 7 ) \ (p', o) where o ^ {push„, pop„}. 
Hence, we have 


Up,P i, ■■■Pm, O, P), 7 , Att) ((p',Pl, . . .Pm, O \ {&}, P), o) 

in P_i and thus we can extend p_i with a transition c_i A- c'_i via this rule. The new 
frontier is obtained by replacing c_i, p, O, B with c'_i, p', O \ { 6 } , P where p' is the child 
of p. The properties on the frontier are easily seen to be retained. 
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• T' = e[Ti ,... ,Ti] from a rule {p, 7 ) {p'l, ■ ■ ■ ,p'i, rew^) of P. 

We separate B = W • • • W such that S' is the set of characters a that have their 
score derived from Tj (i.e. the subtree with the higher score for a characters). Let 
O' be the set of all a who had a +1 in their score derived from another subtree. Let 
(xi, s),... {xi, s) be the configurations labelling the root nodes r]Q,r]i,... ,r]i of these sub¬ 
trees. Let (j/i, s),..., {pj, s) be the configurations labelling the root nodes of the remaining 
subtrees. Since T' includes m leaves that are followed in p by pops to pi,... ,pm we can 
distribute these control states amongst the branches, obtaining 

x\,... .. .x\,... .. .y{,... ,yl. . 

Finally, we can distribute 

O U O' = U • • • U X* U Yi U • • • U Y,- 


amongst the subtrees Ti,... ,Ti since O can be distributed by assumption and we chose 
O' such that this can be done. 

From the runs corresponding to Ti,... ,Ti and our choices above we know that the tests 


€£( 



,..., c_i e/: A^ , 


. 



■KJ 


■A 

) 

L the 

rule 





)>7, 

yi,vi, 

17 ... 

n 

/ {xi,x\,... 
o'ns, / ^ 

. . . , 



Vi,Vl, 

> 

\ {xi,x\,.. 

X^- 

Wi, Bi) 



and obtain configurations c)_i,..., c!_i and a new frontier satisfying the required properties 
by replacing c-i,r],0, B with the sequence 

• T' = e[Ti ,... ,Ti] not from a rule (p, 7 ) -4 (p'^, ■. ■ ,p'i, rew^) of P. 

In this case we have that T' (subtree of the decomposition T) corresponds to a run tree 
Pt' that can be decomposed into 

— c[p'] with c' = (p', push„(s)) at the root of p' via a rule (p, 7 ) (p',push„) and I 
leaf nodes labelled c\,...,ci respectively, and 

— runs pi,... ,pi with the roots labelled c'l = (p^, s),...,c( = (pj, s) where, for each i, 
we have c* c' via a pop„ rule, and these are the first points s is seen along each 
branch, and 

— the leaves of pi,..., p; are the leaves of pT' ■ 

There are two cases depending on whether we send the HOPDA down the branch corre¬ 
sponding to the push. 

— We separate B = Hq l±) HJ • • • W B' such that S' is the set of characters a 
that have their score derived from Tj (i.e. the subtree with the higher score for 
a characters). Assume Ti is amongst these subtrees (and will get B'q). Let O' 
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be the set of all a who had a +1 in their score derived from another subtree. 
Let (p', push„(s)), (xi, s),... (cci, s) be the configurations labelling the root nodes 
Tji, ... ,r]i of these subtrees, with the first belonging to Ti. Let {yi, s),..., {yj, s) be 
the configurations labelling the root nodes of the remaining subtrees. Since T' has 
m leaves that are followed in p by pops to pi,..., Pm we can distribute these control 
states amongst the branches, obtaining 

x).yl, ...,yl^,...y{,...,yj. . 

We can also distribute 


o u O' = X u Xi u • • • u w u Fi u • • • u y,- 


amongst the subtrees Ti,..., T; with X belonging to Ti since O can be distributed 
by assumption and we chose O' such that this can be done. 


From the existence of the runs pi,..., pi we know c_i & C,[ i i ) > ■ • 

\ Pi)2/i ’ • • •)2/j j 


■, c_i G 


C A 


Hence, we apply to c_i the rule 


/ 



/ ip',p'i,- 

. ,p'i,X,Bq) , \ 

(PIPI,- 

.,Pm,0,B),"f, 

A ^ - 

o'nB 

- 

{xi,x\,.. 

■,Xj^,Xi,Bi) , 

' , rew..^ 

. . . , 

V 



V {xi,x\,. 

.,x].,X„B,) ) 


and obtain configurations c^i,c^i, ■ ■ ■ ,c'_i and a new frontier satisfying the required 
properties by replacing c_i, 77 , O, B with the sequence 


c_i, 770) Bq, c_i, ? 7 i, Xi, Bi,... c_i, Tji, Xi, . 


— We separate i? = W • • • W H' such that is the set of characters a that have their 
score derived from Tj (i.e. the subtree with the higher score for a characters). Assume 
Ti is not amongst these subtrees. Let O' be the set of all a who had a +1 in their score 
derived from another subtree. Let (xi, s),... {xi,s) be the configurations labelling 
the root nodes rji, ... ,r]i of these subtrees. Let (p',push„(s)), (pi, s),..., {pj, s) be 
the configurations labelling the root nodes of the remaining subtrees, with the first 
belonging to Ti. Since T' has m leaves that are followed in p by pops to pi,..., Pm 
we can distribute these control states amongst the branches, obtaining 

x\,...,x]^,...x\,..., x).y\,... ,y]^,.. .yi,... ,yl. . 

We can also distribute 


O U O' = Ai U • • • U X, U F U Fi U • • • U Fj 

amongst the subtrees Ti,... ,Ti with F belonging to Ti since O can be distributed 
by assumption and we chose O' such that this can be done. 

From the existence of p' we know that c_i G C{ A^, , , ) and from the existence 

\ p )Pi v,Piy 

of Pi,..., p; we also know c_i € £ ( A^l 1 1 ), ..., c_i € £( , , 
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Hence, we apply to c_i the rule 


/ 

V 


■•Pi 


n- • -n 

yj,y{,...,y\ 


n \ 


o'ns 
-> 


{xi,x\, 

{xi,x\ 


,x].,X,,B,) 



and obtain configurations and a new frontier satisfying the required 

properties by replacing c_i, 77 , O, B with the sequence 


Xi, B[,.. Xi, Bl . 


Finally, we reach a leaf node p with a run outputting the required number of as. We need 
to show that the run constructed is accepting. From the tree decomposition, we know that the 
corresponding node of p is immediately followed by a pop„. Thus, from our conditions on the 
frontier, we must have m = 1 and 0 = 0. We also have a rule (p, 7 ) (pi, pop„) and therefore 
((p,pi, 0 , H), 7 , Htt) (/, rew.^) with which we can complete the run of P_i as required. □ 


7.4 The Other Direction 

Finally, we need to show that each accepting run tree of P_i gives rise to an accepting run tree 
of P containing at least as many of each output character a. 

Lemma 7.3 (P_i to P). We have Diagonal^^^ ,^^{P-i) implies Diagonal^^^ 

Proof. Take an accepting run tree p_i of P_i. We show that there exists a corresponding run 
tree p of P outputting at least as many as. 

We maintain a frontier 

Cl,..., Ch 

of p_i and a run p of P “with holes” such that 

• there are h nodes of p labelled by ci,..., c?i respectively (these are the holes), and 

• each of these holes labelled c is the only child of a parent node labelled c' of P, and 

• for each corresponding pair c and c' we have 

— c' = (p, s), and 

— c = {{p,pi,...,Pm,0,B),top„{s)), and 

— the node labelled by c has m children with the ith child being labelled (pi, pop„(s)), 
and 

— all leaf nodes of p are accepting, and 

— for each aG{ai,...,aQ,} the number of a output by run tree of P is at least as many 
as on the branch of P_i to the configuration with a € P less 1 if a G O. 

Initially after a rule from Pinit we have the frontier c = ((p,p/,... ,p/, 0, {ai,..., Uq}), s) 
with corresponding run p of P being 

(P, MnM{Pfd]n)i---APfd]n)]]- 
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Pick a configuration c_i = ... ,Pm, O, B), top„(s)) of the frontier that is not a leaf 

of p-i and its corresponding node in p with parent labelled c = {p, s). Let p'_i be the subtree 
of P-i rooted at this configuration. 

We show how to extend the frontier closer to the leaves of p_i. There are several cases 
depending on the transition of P_i used to exit our chosen node. 

• p'_i= C-i and the rule applied is of the form 

{{p,Pi,...,Pm,0,B),j,Att) {{p',pi,...,pm,0\{b},B),o) . 


Let c'_i be the configuration labelling the root oi We have (p, 7 ) A {p' , 0 ) G TZ and 

o ^ {push„, pop„}. We can apply c \ c'. Let p be the node labelled c_i. We insert above 
p a node labelled c'. Then we change the label of p to c'_i. We keep the same children of 
p. This extended run maintains all properties as required. 

• p'-i = c_i ..., pLi] via a rule 

{xi,x\,.. .,x]^,Xi,Bi) , 

..., , rew^ 

T X^^ . . . j Xj^j X-i , Bi'j 

derived from some rule 

(p, 7 ) 4 {p[,... ,p'i,iew^) & n . 

In this case, we apply the above rule to p which means taking the node p labelled c 
and replacing its “hole” child with I new children. We need to rebuild the rest of the 
tree the from these nodes. These nodes have configurations {p[,s),..., {p'i,s). These 
control states are distributed between xi,... ,Xi and yi,... ,yj. Consider yi (the other 
y 2 ,... ,yj are identical). We have from the respective passed test that (j/i, s) has a run 
where the first popping of the top„ stack leads to configurations (yj, s),..., {yj^, s). We 
insert this run underneath the node corresponding to the yi. Since yl,..., yj^ appear 
amongst pi,..., pm we append the subtrees that appeared as the relevant children of the 
node labelled c_i to complete these branches. The remaining subtrees corresponding to 
pi,...,Pm are distributed amongst x),..., ,..., a;(,..., . Consider xi (the others are 

identical arguments), we have a new child labelled by ((xi, x},..., , Xi, Bi ), top„(s)). 

We take the subrees distributed to x},..., x]^ as children of this new child to satisfy the 
requirements. 

The new frontier replaces c_i with 


/ 


{p,Pl,.--,Pin,0,B),J, 


n... 

Y, 


n 


V 


A 




((xi, x(,. • ■, x]^, Xi,Bi) , top„(s)), 

. . . , 

((xj, xl,..., x)., Xj,Bj), top„(s)) 
which satisfies all properties as needed. 

• p(_i = c_i ..., pLi] via a rule 


/ 



/ ip',p'i,-- 

.. ,p'i, X, Bq) , \ 


..,Pm,o,B),j, n-'-n 

o'nB 
-> 

{xi,xl, .. 

■,Xj^,Xi,Bi) , 

' , rew..^ 

V 

/lb) 

yi,yl,---,ylj J 


\ (x*,xl,.. 

. ,xj.,Xi,B,) J 
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derived from some rule 

(p.7) ^ (p',push„) 

In this case, we apply the above rule to p. This means replacing the node labelled c_i 
with one labelled (p', push„(s)). This new node has a new child node with the label 

((p',pi,---,Po^,5o),top„(s)) . 


We need to add I children to this new “hole” node. 

These nodes have configurations {p'l, s),..., {p[, s) (since s = pop„(push„(s))). These 
control states are distributed between xi,... ,Xi and yi,... ,yj. Consider yi (the other 
y 2 , ■ ■ ■ ,yj are identical). We have from the passed test that (t/i, s) has a run where the 
first popping of the top„ stack leads to configurations (j/^,pop„(s)),..., ,pop„(s)). 

We append this run tree as a child of the node corresponding to yi. Since yl,..., y}^ 
appear amongst pi,...,Pm we append the relevant subtrees we had already constructed 
for these nodes to complete these branches with the required properties. 

Now consider xi (the other cases are symmetric). In this case we append a node la¬ 
belled ((xi, x),..., , Xi, Bi ), top„(s)) as a child of the node corresponding to xi. Since 

xl, appear amongst pi, ... ,Pm we append the relevant subtrees we had already 
constructed for these nodes to complete these branches with the required properties. 

The new frontier replaces c_i with 

Bo),top„(s)) 


and 

((xi, x),. • ■, x]^, Xi,Bi), top„(s)), 

. . . , 

{{xj,x \,... ,x).,Xi,B,),top„(s)) 
which satisfies all the required properties. 


p'_i = c_i ..., pbi] via a rule 

{p,pi,... ,Pm,0,B),'y, 

V 


t 1 - 1 

o'ns j 
-^ 

{ (xi,x),.. 

■,x]^,Xi,Bi) 

c 

c 

1 / 

.,x].,Xi,B,) 



V {Xi,xl,.. 



derived from some rule 

(p, 7 ) 4 (p',push„) 

In this case, we again apply the above rule to p. This means replacing the node labelled 
c_i with one labelled (p', push„(s)). Since we know the test p/ p/ passed we have 
a run popping the newly pushed stack to controls p'^,... ,p[. We set this run tree as the 
only child of the node whose label we replaced. This new tree has I leaves which we need 
to complete. 

These leaf nodes are completed using the same argument as the previous case. That 
is, they are labelled with configurations {p'i,s),..., {p'l, s). These control states are dis¬ 
tributed between xi,...,Xi and yi,... ,yj. Consider yi (the other y 2 ,... ,yj are identical). 
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We have from the passed test that {yi , s) has a run where the first popping of the top„ 
stack leads to configurations pop„(s)),..., , pop„(s)). We append this run tree 

as a child of the node corresponding to yi. Since y\,..., y}^ appear amongst pi,...,Pm 
we append the relevant subtrees we had already constructed for these nodes to complete 
these branches with the required properties. 

Now consider xi (the other cases are symmetric). In this case we append a node la¬ 
belled ((xi, a;},..., , Xi, Bi), top„(s)) as a child of the node corresponding to a;i. Since 

x\,..., appear amongst pi,...,pm we append the relevant subtrees we had already 
constructed for these nodes to complete these branches with the required properties. 

The new frontier replaces c_i with 

{{p',P'l,---,P'l,X,BQ),tOPn{s)) 

and 

{[xi,x \,... ,x]^,Xi,Bi),top„(s)), 

({xj ,x\,...,x’j.,X„Bi), top„ (s)) 
which satisfies all the required properties. 

• p'-i = c_i[(/,s)]. 

In this case c has the form 

((p,p',0,B),top„(s)) 

and there is a rule 

(p, 7 ) 4 (p',pop„) . 

We can remove the hole from p by applying this rule. That is, we remove the hole 
node, setting its parent to have its (only) child as its child. This is possible since by our 
conditions the child has the label (p^pop„(s)). We remove c_i from the frontier. 

Thus, the frontier moves towards the leaves of the tree and finally is empty. At this point we 
have an accepting run of P as required. To see that the run outputs enough of each character, 
one needs to observe that at each stage the tests and O component of the control state ensured 
at least one character output for each that appeared in some O' labelling a transition. Then, 
for characters output along branches followed were reproduced faithfully. □ 

8 Conclusions 

We have shown, using a recent result by Zetzsche, that the downward closures of languages 
defined by HOPDA are computable. We believe this to be a useful foundational result upon 
which new analyses may be based. Our result already has several immediate consequences, 
including separation by piecewise testability and asynchronous parameterised systems. 

Regarding the complexity of the approach. We are unaware of any complexity bounds im¬ 
plied by Zetzsche’s techniques. Due to the complexity of the reachability problem for HOPDA, 
the test automata may be a tower of exponentials of height n for HOPDA of order n. These 
test automata are built into the system before proceeding to reduce to order (n — 1). Thus, we 
may reach a tower of exponentials of height O(n^). 

A natural next step is to consider collapsible pushdown systems, which are equivalent to 
recursion schemes (without the safety constraint). However, it is not currently clear how to 
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generalise our techniques due to the non-local behaviour introduced by collapse. We may also 
try to adapt our techniques to a higher-order version of BS-automata [3] , which may be used, 
e.g., to check boundedness of resource usage for higher-order programs. 
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